Ecological Inference and the Ecological Fallacy

نویسنده

  • David A. Freedman
چکیده

In 19th century Europe, suicide rates were higher in countries that were more heavily Protestant, the inference being that suicide was promoted by the social conditions of Protestantism (Durkheim 1897; also see Neeleman and Lewis 1999). According to Carroll (1975), death rates from breast cancer are higher in countries where fat is a larger component of the diet, the idea being that fat intake causes breast cancer. These are ‘ecological inferences,’ that is, inferences about individual behavior drawn from data about aggregates. To continue with Durkheim, the Protestant countries were different from the Catholic countries in many ways besides religion (the problem of ‘confounding’). Moreover, Durkheim’s data do not tie individual suicides to any particular religious faith. The first problem, of confounding, must be dealt with in any observational study. But the second problem—that exposure and response are measured only for aggregates rather than for individuals—is specific to ecological studies. If there is no confounding, the expected difference between effects for groups and effects for individuals is ‘aggregation bias’; in general, the difference is partly attributable to confounding and partly to aggregation bias. The ecological fallacy consists in thinking that relationships observed for groups necessarily hold for individuals: if countries with more Protestants tend to have higher suicide rates, then Protestants must be more likely to commit suicide; if countries with more fat in the diet have higher rates of breast cancer, then women who eat fatty foods must be more likely to get breast cancer. These inferences may be correct, but are only weakly supported by the aggregate data. Ecological studies in epidemiology yielded important insights for Snow (1855), Finlay (1881), Goldberger (Terris 1964), and Dean (1938) among others. However, it is all too easy to draw incorrect conclusions from aggregate data. Greenland and Robins (1994) review the issues. For one example, recent studies of individual-level data cast serious doubt on the link between breast cancer and fat intake (Holmes et al. 1999). Another well-known example, on the sources of popular support for the Nazi party in pre-war Germany, is discussed by Lohmoller et el. (1985).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The fallacy of the ecological fallacy: the potential misuse of a concept and the consequences.

Ecological studies have been evaluated in epidemiological contexts in terms of the "ecological fallacy." Although the empirical evidence for a lack of comparability between correlations derived from ecological- and individual-level analyses is compelling, the conceptual meaning of the ecological fallacy remains problematic. This paper argues that issues in cross-level inference can be usefully ...

متن کامل

Estimating the likelihood of fallacious ecological inference: linear ecological regression in the presence context effects

ABSTXKT. In situations where the only reliable data source is electoral data at the aggregate level for a geographic unit such as voting precincts, social scientists have sought to use ecological regression techniques to recreate the voting behavior of particular groups without committing the ecological fallacy of the sort warned of by Robinson (1950). Until quite recently, the most common use ...

متن کامل

The individualistic fallacy, ecological studies and instrumental variables: a causal interpretation

The validity of ecological studies in epidemiology for inferring causal relationships has been widely challenged as observed associations could be biased by the Ecological Fallacy. We reconsider the important design components of ecological studies, and discuss the conditions that may lead to spurious associations. Ecological associations are useful and valid when the ecological exposures can b...

متن کامل

Using census data to investigate the causes of the ecological fallacy.

"The authors show how data from the 2% Sample of Anonymised Records (SAR) can be combined with data from the Small Area Statistics (SAS) database to investigate the causes of the ecological fallacy in an Enumeration District (ED) level analysis. A range of census variables are examined in three ¿SAR districts'...in England. Results of comparable analyses from the 1986 Australian census are also...

متن کامل

Emerging Themes in Epidemiology

The validity of ecological studies in epidemiology for inferring causal relationships has been widely challenged as observed associations could be biased by the Ecological Fallacy. We reconsider the important design components of ecological studies, and discuss the conditions that may lead to spurious associations. Ecological associations are useful and valid when the ecological exposures can b...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999